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Abstract 

We consider the class of packing integer programs (PIPs) that are column sparse, where there is 
a specified upper bound fc on the number of constraints that each variable appears in. We give an 
improved (efc + o(fc))-approximation algorithm for fc-column sparse PIPs. Our algorithm is based 
on a linear programming relaxation, and involves randomized rounding combined with alteration. 
We also show that the integrality gap of our LP relaxation is at least 2k — 1; it is known that even 
special cases of fc-column sparse PIPs are f2(j^:)-hard to approximate. 

We generalize our result to the case of maximizing monotone submodular functions over fc- 
column sparse packing constraints, and obtain an (jirzj + o(fc)J -approximation algorithm. In ob- 
taining this result, we prove a new property of submodular functions that generalizes the fractionally 
subadditive property, which might be of independent interest. 

When the capacities of all constraints are large relative to the sizes, we obtain substantially better 
guarantees for these fc-column sparse packing problems; again our result is tight (up to constant 
factors) relative to the natural LP relaxation. 

1 Introduction 

Packing integer programs (PIPs) are those of the form: 

max {w T x | Sx < c, x G {0, l} n } , where w £ R%, c £ W£ and S £ R+ Xn . 

Above, n is the number of variables/columns, m is the number of rows/constraints, S is the matrix of 
sizes, c is the capacity vector, and w is the weight vector. In general, PIPs are very hard to approximate: 
a special case is the classic independent set problem, which is NP-Hard to approximate within a factor of 
n 1 ~ e ll30l . whereas an n-approximation is trivial. Thus, various special cases of PIPs are often studied. 
Here, we consider k-column sparse PIPs (denoted fc-CS-PIP), which are PIPs where the number of 
non-zero entries in each column of matrix S is at most k. This is a fairly general class and models 
several basic problems such as fc-set packing [flj] and independent set in graphs with degree at most k. 

Recently, in a somewhat surprising result, Pritchard E51 gave an algorithm for fc-CS-PIP where the 
approximation ratio only depends on k; this is useful when k is small. This result is surprising because in 
contrast, no such guarantee is possible for fc-row sparse PIPs. In particular, the independent set problem 
on general graphs is a 2-row sparse PIP, but is n 1 ~°^ 1 )-hard to approximate. Pritchard's algorithm |[25l 
had an approximation ratio of 2 k • k 2 . Subsequently, an improved 0(k 2 ) approximation algorithm was 
obtained independently by Chekuri et al. lfl3l and Chakrabarty-Pritchard iflOl . 
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Our Results: In this paper, we first consider the /c-CS-PIP problem and obtain an (ek + o(k))- 
approximation algorithm for it. Our algorithm is based on solving a strengthened version of the nat- 
ural LP relaxation of /c-CS-PIP, and then performing randomized rounding followed by suitable al- 
terations. In the randomized rounding step, we pick each variable independently (according to its LP 
value) and obtain a set of variables with good expected weight; however, some constraints may be vi- 
olated. Then in the alteration step, we drop some variables so as to satisfy all constraints, while still 
having good expected weight. A similar approach can be used with the natural relaxation for /c-CS-PIP 
obtained by simply dropping the integrality constraints on the variables; this gives a slightly weaker 8k- 
approximation bound. However, the analysis of this weaker result is much simpler and we thus present 
it first. To obtain the ek + o(k) bound, we construct a stronger LP relaxation by adding additional valid 
constraints to the natural relaxation for /c-CS-PIP. The analysis of our rounding procedure is based on 
exploiting these additional constraints and using the positive correlation between various probabilistic 
events via the FKG inequality. 

Our result is almost the best possible that one can hope for using the LP based approach. We show 
that the integrality gap of the strengthened LP is at least 2k — 1, so our analysis is tight up to a small 
constant factor e/2 « 1.36 for large values of k. Even without restricting to LP based approaches, an 
0(k) approximation is nearly best possible since it is NP-Hard to obtain an o(k/ log /^-approximation 
for the special case of fc-set packing IflTl . We also obtain improved results for /c-CS-PIP when capacities 
are large relative to the sizes. In particular, we obtain a 0(/c 1 /L B J ) -approximation algorithm for /c-CS- 
PIP, where B := min ig [ n ] jg[ m i Cj/sij measures the relative slack between the capacities c and sizes S. 
We also show that this result is tight up to constant factors relative to its LP relaxation. 

Our second main result is for the more general problem of maximizing a monotone submodular 
function over packing constraints that are /c-column sparse. This problem is a common generalization of 
maximizing a submodular function over (a) a /c-dimensional knapsack EH, and (b) the intersection of 
k partition matroids Il24l . Here, we obtain an ^f^j + o(k) S j -approximation algorithm for this problem. 
Our algorithm uses the continuous greedy algorithm of Vondrak |29l in conjunction with our randomized 
rounding plus alteration based approach. However, it turns out that the analysis of the approximation 
guarantee is much more intricate: In particular, we need a generalization of a result of Feige lfT51 that 
shows that submodular functions are also fractionally subadditive. See Section [3] for a statement of the 
new result, Theorem 13.31 and related context. This generalization is based on an interesting connection 
between submodular functions and the FKG inequality. We believe that this result and technique might 
be of further use in the study of submodular optimization. 

Related Previous Work: Various special cases of /c-CS-PIP have been extensively studied. An im- 
portant special case is the k-set packing problem, where given a collection of sets of cardinality at most 
k, the goal is to find the maximum weight sub-collection of mutually disjoint sets. This is equivalent 
to /c-CS-PIP where the constraint matrix S is 0-1 and the capacity c is all ones. Note that for k = 2 
this is maximum weight matching which can be solved in polynomial time, and for k = 3 the problem 
becomes APX-hard lfl7l . After a long line of work |[T8l l2l[TTll8i. the best-known approximation ratio 
for this problem is + e obtained using local search techniques (8). An improved bound of | + e is 
also known |[T8l for the unweighted case, i.e., the weight vector w = 1. It is also known that the natural 
LP relaxation for this problem has integrality gap at least k — 1 + 1/k, and in particular this holds for 
the projective plane instance of order k — 1. Hazan et al. ifTTl showed that /c-set packing is r2(^|^)-hard 
to approximate. 

Another special case of /c-CS-PIP is the independent set problem in graphs with maximum de- 
gree at most k. This is equivalent to /c-CS-PIP where the constraint matrix S is 0-1, capacity c is all 
ones, and each row is 2-sparse. This problem has an 0(/c log log /c/ log /^-approximation lfT6ll . and is 
VL{k/ log 2 /c)-hard to approximate 0, assuming the Unique Games Conjecture lTT9l . 

Shepherd and Vetta Il26l studied the demand matching problem on graphs, which is /c-CS-PIP with 
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k = 2, with the further restriction that in each column the non-zero entries are equal, and that no two 
columns have non-zero entries in the same two rows. They gave an LP-based 3.264-approximation 
algorithm E6l . and showed that the natural LP relaxation for this problem has integrality gap at least 
3. They also showed the demand matching problem to be APX-hard even on bipartite graphs. For 
larger values of k, problems similar to demand matching have been studied under the name of column- 
restricted PIPs l20l . which arise in the context of routing flow unsplittably (see also (5j|6l). In particular, 
an 11.54A;-approximation algorithm was known lfT4l where (i) in each column all non-zero entries are 
equal, and (ii) the maximum entry in S is at most the minimum entry in c (this is also known as the no 
bottle-neck assumption); later, it was observed in [12] that even without the second of these conditions, 
one can obtain an 8A; approximation. The literature on unsplittable flow is quite extensive; we refer the 
reader to (4j [121 and references therein. 

For the general /c-CS-PIP, Pritchard [25 ] gave a 2 fc /^-approximation algorithm, which was the first 
result with approximation ratio depending only on k. Pritchard's algorithm was based on solving an 
iterated LP relaxation, and then applying a randomized selection procedure. Independently, lfT3l and 
ifTOll showed that this final step could be derandomized, yielding an improved bound of 0(k 2 ). All these 
previous results crucially use the structural properties of basic feasible solutions of the LP relaxation. 
However, as stated above, our result is based on randomized rounding with alterations and does not use 
properties of basic solutions. This is crucial for the submodular maximization version of the problem, 
as a solution to the fractional relaxation there does not have these properties. 

We remark that randomized rounding with alteration has also been used earlier by Srinivasan E8l 
in the context of PIPs. However, the focus of this paper is different from ours; in previous work ll27l . 
Srinivasan had bounded the integrality gap for PIPs by showing a randomized algorithm that obtained 
a "good" solution (one that satisfies all constraints) with positive — but perhaps exponentially small — 
probability. In ll28l . he proved that rounding followed by alteration leads to an efficient and parallelizable 
algorithm; the rounding gives a "solution" of good value in which most constraints are satisfied, and 
one can alter this solution to ensure that all constraints are satisfied. (We note that ll27l l28l also gave 
derandomized versions of these algorithms.) 

Related issues have been considered in discrepancy theory, where the goal is to round a fractional so- 
lution to a /c-column sparse linear program so that the capacity violation for any constraint is minimized. 
A celebrated result of Beck-Fiala Q shows that the capacity violation is at most 0(k). A major open 
question in discrepancy theory is whether the above bound can be improved to 0(y/k), or even 0(k 1 ~ € ) 
for some e > 0. While the result of |[25l uses techniques similar to that of (7), a crucial difference in 
our problem is that no constraint can be violated at all. In fact, at the end of Section [2l we show another 
crucial qualitative difference between discrepancy and /c-CS-PIP. 

There is a large body of work on constrained maximization of submodular functions; we only cite 
the relevant papers here. Calinescu et al. (9l introduced a continuous relaxation (called the multi-linear 
extension or extension-by-expectation) of submodular functions and subsequently Vondrak f29l gave 
an elegant ^--approximation algorithm for solving this continuous relaxation over any "downward 
monotone" polytope V, as long as there is a polynomial-time algorithm for optimizing linear functions 
over V. We use this continuous relaxation in our algorithm for submodular maximization over fc-sparse 
packing constraints. As noted earlier, £>sparse packing constraints generalize both /c-partition matroids 
and A;-dimensional knapsacks. Nemhauser et al. G4l gave a (k + 1) -approximation for submodular 
maximization over the intersection of k partition matroids; when k is constant, Lee et al. 122 1 improved 
this to k + e. Kulik et al. ETS gave an (^ri + -approximation for submodular maximization over 
/c-dimensional knapsacks when k is constant; if k is part of the input, the best known approximation 
bound is O(k). 

Problem Definition and Notation: Before we begin, we formally describe the /c-CS-PIP problem 
and fix some notation. Let the items (i.e., columns) be indexed by i G [n] and the constraints (i.e., rows) 
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be indexed by j G [m] . We consider the following packing integer program. 




i=l 



Sjj • Xi < cj, V j G [m]; G {0, 1}, Vi G [n] 



We say that item i participates in constraint j if Sjj > 0. For each i G [n], let iV(i) := {j G [m] \ 
sij > 0} be the set of constraints that i participates in. In a £>column sparse PIP, we have |iV(i)| < k 
for each i G [n]. The goal is to find the maximum weight subset of items such that all the constraints are 
satisfied. 

We define the slack as B := minj 6 r re |j 6 r m i Cj/sij. By scaling the constraint matrix, we may assume 
that Cj = 1 for all j G [m]. We also assume that Sjj < 1 for each otherwise, we can just fix Xi = 0. 
Finally, for each constraint j, we let P(j) denote the set of items participating in this constraint. Note 
that |-P(j)| can be arbitrarily large. 

Organization: In Section |2] we begin with the natural LP relaxation, and describe a simple algorithm 
with approximation ratio 8k. We then present a stronger relaxation, and use it to obtain an (e + o(l))k- 
approximation. We also present the integrality gap of 2k — 1 for this strengthened LP, implying that 
our result is almost tight. In Section [3j we describe the ^^-j + o(l)J /c-approximation for fc-column 

sparse packing problems over a submodular objective. Finally, in SectionHJ we deal with the /c-CS-PIP 
problem when the capacities of all constraints are large relative to the sizes, and obtain significantly 
better approximation ratios. Again there is a matching integrality gap up to a constant factor. 

2 Approximation Algorithms for /c-CS-PIP 

Before presenting our algorithm, we describe a (seemingly correct) algorithm that does not quite work. 
Understanding why this easier algorithm fails gives useful insight into the design for the correct algo- 
rithm. 

A strawman Algorithm: Consider the following algorithm. Let x be some optimum solution to the 
natural LP relaxation of A;-CS-PIP (i.e. dropping integrality). For each element i G [n], select it 
independently at random with probability Xi/{2k). Let S be the chosen set of items. For any constraint 
j G [m], if it is violated, then discard all items S n P(j), i.e. items i G S for which > 0. 

Since the probabilities are scaled down by 2k, by Markov's inequality any constraint j is violated 
with probability at most l/(2k). Hence, any constraint will discard its items with probability at most 
1/2 A;. By the /c-sparse property, each element can be discarded by at most k constraints, and hence by 
union bound over those k constraints, it is discarded with probability at most k • (l/2k) = 1/2. Since 
an element is chosen in S with probability Xi/2k, this implies that it lies in the overall solution with 
probability at least Xj/(4/c), implying that the proposed algorithm is a 4k approximation. 

However, the above argument is not correct. Consider the following example. Suppose there is a 
single constraint (and so k = 1), 



where M S> 1 is a large integer. Clearly, setting xi = 1/2 for i = 1, ... ,M is a feasible solution. 
Now consider the execution of the strawman algorithm. Note that whenever item 1 is chosen in S, it 
is very likely that some item other than 1 will also be chosen (since M S> 1 and we pick each item 
independently with probability Xi/2k = 1/4); in this case, item 1 would be discarded. Thus the final 
solution will almost always not contain item 1, violating the claim that it lies in the final solution with 
probability at least x\/4k = 1/8. 



Mxi + x 2 + x 3 + x A + 



. + x M < M 
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The key point is that we must consider the probability of an item being discarded by some constraint, 
conditional on it being chosen in the set S (for item 1 in the above example, this probability is close to 
one, not at most half). This is not a problem if either all item sizes are small (i.e. say Sij < Cj/2), or 
all item sizes are large (say « cj). The algorithm we analyze shows that the difficult case is indeed 
when some constraints contain both large and small items, as in the example above. 

2.1 A Simple Algorithm for /c-CS-PIP 

In this subsection, we use the obvious LP relaxation for /c-CS-PIP (i.e. dropping the integrality condi- 
tion) and obtain an 8/c-approximation algorithm. An item i £ [n] is called big for constraint j £ [m] iff 
Sij > ^; and i is small for constraint j iff < Sjj < \. The algorithm first solves the LP relaxation to 
obtain an optimal fractional solution x. Then we round to an integral solution as follows. With foresight, 
set a = 4. 

1. Sample each item i £ [n] independently with probability Xi/(ak). 
Let S denote the set of chosen items. We call an item in S an 5-item. 

2. For each item i, mark i (for deletion) if, for any constraint j £ N(i), either: 

• S contains some other item i' £ [n] \ {i} which is big for constraint j or 

• The sum of sizes of 5-items that are small for j exceeds 1. (i.e. the capacity). 

3. Delete all marked items, and return S', the set of remaining items. 

Analysis: We will show that this algorithm gives an 8k approximation. 
Lemma 2.1. Solution S' is feasible with probability one. 
Proof. Consider any fixed constraint j £ [m] . 

1. Suppose there is some i' £ S' that is big for j. Then the algorithm guarantees that i' will be the 
only item in S' (either small or big) that participates in constraint j: Consider any other 5-item i 
participating in j; i must have been deleted from S because S contains another item (namely i') 
that is big for constraint j. Thus, i' is the only item in S' participating in constraint j, and so the 
constraint is trivially satisfied, as all sizes < 1. 

2. The other case is when all items in S' are small for j. Let i £ S' be some item that is small for j 
(if there are none such, then constraint j is trivially satisfied). Since i was not deleted from S, it 
must be that the total size of 5-items that are small for j did not exceed 1. Now, S' C S, and so 
this condition is also true for items in S'. 

Thus every constraint is satisfied by solution S' and we obtain the lemma. □ 

We now prove the main theorem. 

Theorem 2.2. For any item i £ [n], the probability Pr[z £ S' \ i £ S] > 1 — — . Equivalently, the 
probability that item i is deleted from S conditional on it being chosen in S is at most 2 /a. 

Proof. For any item i and constraint j £ N(i), let Bij denote the event that i is marked for deletion from 
S because there is some other 5-item that is big for constraint j. Let Gj denote the event that the total 
size of 5-items that are small for constraint j exceeds 1. For any item i £ [n] and constraint j £ N(i), 
we will show that: 

Pr[B y \ ieS] + Pr[Gj | i £ S] < — (1) 
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We prove (Q} using the following intuition: The total extent to which the LP selects items that are big 
for any constraint cannot be more than 2 (each big item has size at least 1/2); therefore, Bjj is unlikely 
to occur since we scaled down probabilities by factor ak. Ignoring for a moment the conditioning on 
i G S, event Gj is also unlikely, by Markov's Inequality. But items are selected for S independently, 
so if i is big for constraint j, then its presence in S does not affect the event Gj at all. If i is small for 
constraint j, then even if i G S, the total size of 5-items is unlikely to exceed 1. 

We now prove CD formally, using some care to save a factor of 2. Let B(j) denote the set of items 
that are big for constraint j, and Yj := YleeB(j) Xi - ^ v tne LP constraint for j, it follows that Yj < 2 
(since each t G B(j) has size S£j > |). Now by a union bound, 

Pr[Be | ie SJ<J_ £ ^<^<^. W 

eeB(j)\{i} 

Now, let G-i(j) denote the set of items that are small for constraint j, not counting item i, even if it 
is small. Using the LP constraint j, we have: 

^ sej ■ xi <1 - ^2 sej-xe<l-^. (3) 
teG_i(>-) ^eB(jf) 

Since each item i' is chosen into S with probability xy j (ak), inequality ([3]) implies that the expected 
total size of 5-items in G-i(j) is at most -K- (1 — Yj/2). By Markov's inequality, the probability that the 
total size of these 5-items exceeds 1/2 is at most \ (1 — Yj/2). Since items are chosen independently 
and i G_j(j), we obtain this probability even conditioned on i S 5. 

If i is big for j, event Gj occurs only if the total size of 5-items in G-i(j) exceeds 1. If i is small 
for j, event Gj occurs only if the total size of small 5-items participating in j exceeds 1; as sij < 1/2, 
the total size of 5-items in G-i(j) must exceed 1/2. Thus, whether i is big or small, 

P „ Gj | i6S] <J.( 1 _5)_J._i. 

Combined with inequality (O we obtain (Q]): 

Pr[Bij \ i£S}+ Pr[Gj \ i £ S] < ^ + ??[Gj \ i£S}<^ + ^--^- = ^-. 

ak ak ak ak ak 

To see that (Q} implies the theorem, for any item i, simply take the union bound over all j E N(i). 
Thus, the probability that i is deleted from S conditional on it being chosen in S is at most 2/ a. Equiv- 
alent^, Pr[t e S' | i e S] > 1 - 21 a. □ 

We are now ready to prove the final result. 

Theorem 2.3. There is a randomized 8k-approximation algorithm for A;-CS-PIP. 

Proof. First observe that our algorithm always outputs a feasible solution (Lemma [2. lb . To bound the 
objective value, recall that Pr[z G S] = ^ for all i G [n]. Hence Theorem [2T2] implies that 

Pr[i G 5'] > Pr[t G «S] • Pr[« G S'\i G S] > ^- ■ ( 1 - - J 

are \ ay 

for all i G [n] . Finally using linearity of expectation and a = 4, we obtain the theorem. □ 
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Remark: We note that the analysis above only uses Markov's inequality conditioned on a single item 
being chosen in set S. Thus a pairwise independent distribution suffices to choose the set S, and hence 
the algorithm can be easily derandomized. 

General upper bounds: The /c-CS-PIP problem as defined assumes all variables to be 0-1. We note that 
our result easily extends to the /c-CS-PIP problem with general upper bounds on variables. Assuming 
an LP-based ^-approximation algorithm for /c-CS-PIP with unit upper-bounds, it is straightforward to 
obtain a (p + 1) -approximation for /c-CS-PIP with general upper-bounds. The algorithm first solves the 
natural LP relaxation to obtain fractional solution y G . Let z G and x G [0, l] n be defined as: 
z i — [Ui\ an d x i = Vi ~ YVi\ f° r ai l i G [n]; note that w T y = w T z + w T x. Clearly z is a feasible 
integral solution. Moreover x is a feasible fractional solution to the same /c-CS-PIP instance even with 
unit upper-bounds. Hence using the rounding algorithm of this subsection, we obtain a feasible integral 
solution x G {0, l} n with w T x > - ■ w T x. It can be seen by simple calculation that the better of z 
and x is a (p + 1) -approximate solution relative to the natural LP relaxation for /c-CS-PIP with general 
upper-bounds. 

2.2 A Stronger LP, and Improved Approximation 

We now present our strengthened LP and the (e/c + o(/c))-approximation algorithm for /c-CS-PIP. 

Stronger LP relaxation. Recall that entries are scaled so that all capacities are one. An item i is called 
big for constraint j iff > 1/2. For each constraint j G [m], let B(j) = {i G [n] \ Sy > ^} denote 
the set of big items. Since no two items that are big for some constraint can be chosen in an integral 
solution, the inequality YlieBQ) < I is valid for each j G [m]. The strengthened LP relaxation that 
we consider is as follows. 

(4) 

Vi G [m] (5) 
Vj G [m]. (6) 

Vi G [n\. (7) 

Algorithm: The algorithm obtains an optimal solution x to the LP relaxation (@]|7]), and rounds it to an 
integral solution S' as follows (parameter a will be set to 1 later). 

1. Pick each item i G [n] independently with probability Xi/(ak). Let S denote the set of chosen 
items. 

2. For any item i and constraint j G N(i), let Ey denote the event that the items {i' G S \ Sj/,- > sy} 
have total size (in constraint j) exceeding one. Mark i for deletion if occurs for any j G N(i). 

3. Return set S' C S consisting of all items i G 5 not marked for deletion. 

Note the rule for deleting an item from S. In particular, whether item i is deleted from constraint j 
only depends on items that are at least as large as i in j. 



> WiXi 

i=i 



£ 

n 

X. ^ ^ Sij • X{ ^ Cj , 

i=\ 
i&B{j) 

< Xi < 1, 
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Analysis: It is clear that S' is feasible with probability one. The main lemma is the following, where 
we show that each item appears in S' with good probability. 

Lemma 2.4. For every item i G [n] and constraint j G N(i), we havePr[Eij \ i G S] < ^ (l + (^:) 1 ^ 3 )- 
Proof. Let i := (&ak) 1 /^ . We classify items in relation to constraints as: 

• Item i G [n] is big for constraint j G [m] if Sjj > |. 

• Item i G [n] is medium for constraint j G [m] if | < Sy < \. 

• Item i G [n] is ft'wy for constraint j G [m] if sij < \. 

For any constraint j G [m], let B(j), M(j),T(j) respectively denote the set of big, medium, tiny 
items for j. In the next three claims, we bound Pr[Eij\i G S] when item i is big, medium, and tiny 
respectively. 

Claim 2.5. For any i G [n] and j G [m] s.t. item i is big for constraint j, Pv[Eij \ i G S] < —j-. 

Proof. The event Eij occurs if some item that is at least as large as i for constraint j is chosen in S. 
Since i is big in constraint j, E^ occurs only if some big item other than i is chosen for S. Now by the 
union bound, the probability that some item from B(j) \ {i} is chosen into S is: 

n[m-)\<i})r\s* t \ ieS ]< E gi^ E 

i'6B(j)\{t} i'6B(j) 

where the last inequality follows from the new LP constraint © on big items for j. □ 
Claim 2.6. For any i G [n], j G [m] item i is medium for constraint j, Pi[Eij \ i G S] < 

Proof. Here, if event E 1 ^ occurs then it must be that either some big item is chosen or (otherwise) 
at least two medium items other than i are chosen, i.e. Eij implies that either Sf]B(j) ^ or 
\S P| (M(j) \ {i}) I > 2. This is because i together with any one other medium item is not enough 
to reach the capacity of constraint j. (Since i is medium, we do not consider tiny items for constraint j 
in determining whether i should be deleted.) 

Just as in Claim 1231 we have that the probability some big item for j is chosen is at most 1/ak, i.e. 

Pr[SnB(j)^0|;eS]<Hfc. 

Now consider the probability that \S f] (M (j) \ {i}) | > 2, conditioned on i G S. We will show that 

this probability is much smaller than 1/ak. Since each item h G M(j) \ {i} is chosen independently 

with probability (even given i G S): 

Pv[\Sf\(M(j)\{i})\>2\ieS 
where the last inequality follows from the fact that 

i > Yl s hj-xh>j Yl Xh 

h€M(j) heM(j) 

(recall each item in M(j) has size at least I). Combining these two cases, we have the desired upper 
bound on Pr[£^ | i G <S]. □ 
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Claim 2.7. For any i G [n], j G [m] s.t. item i is tiny for constraint j, Pr[i?y | i G <S] < -k- (l + i ). 

Proof. Since i is tiny, if event occurs then the total size (in constraint j) of items S \ {i} is greater 
than 1 - \. So, 



PT[Eij | i G S] < Pr 



fte5\{i} 



1 £ 1/2 
< < — I 1 + - 

ak I — 1 ak 



where the first inequality follows from the above observation and the fact that S \ {i} is independent of 
the event i G S, the second is Markov's inequality, and the last uses I > 2. □ 

Thus, for any item i and constraint j G N(i), Pi[Eij \ i G S] < ^max{(l + |), (1 + ^a-)}- 
From the choice of £ = (Aak) 1 ^ 3 , which makes the probability in Claims [Z6l and [2771 equal, we obtain 
the lemma. □ 

We now prove the main result of this section 

Theorem 2.8. For each i G [n], probability Pr[i G <S' | i G S] > (l - ^ (l + (^) 1/3 )) fc - 

Proof. For any item i and constraint j G iV(z), the conditional event (-<Eij \ i G S) is a decreasing 
function over the choice of items in set [n] \ {i}. Thus, by the FKG inequality U, for any fixed item 
i G [n], the probability that no event (Eij \ i G S) occurs is: 



Pr 



[\ -^E i:j \ ieS 
jeJV(i) 



> [] Pr[-.^- | i G S] 

jeN(i) 



From Lemma 12741 Prf-iE 1 ^ | i G S] > 1 - ^ (l + (^)^ 3 )- As each item is in at most k 
constraints, we obtain the theorem. □ 

Now, by setting a = lQ we have Pr[i G S] = 1/k, and Pr[i G S' | i G S] > e+ ^ 1 - ) , which 
immediately implies: 

Theorem 2.9. There is a randomized (ek + o(k)) -approximation algorithm for fc-CS-PIP. 

Remark: We note that this algorithm can be derandomized using conditional expectation and pessimistic 
estimators, since we can compute exactly estimates of the relevant probabilities. Also, using ideas 
from [28] the algorithm can be implemented in RNC. We defer details to the full version. 

Integrality Gap of LP (|4][7]>. Recall that the LP relaxation for the fc-set packing problem has an integral- 
ity gap of k — 1 + 1/k, as shown by the instance given by the projective plane of order k — 1. If we have 
the same size-matrix and set each capacity to 2 — e, this directly implies an integrality gap arbitrarily 
close to 2{k — 1 + 1/k) for the (weak) LP relaxation for /c-CS-PIP. This is because the LP can set each 
xi = (2 — e)/k hence obtaining a profit of (2 — e)(k — 1 + 1/k), while the integral solution can only 
choose one item. However, for our stronger LP relaxation ([4][7]) used in this section, this example does 
not work and the projective plane instance only implies a gap offc — 1 + 1/A; (note that here each item 
is big in every constraint that it appears in). 

However, using a different instance of /c-CS-PIP, we show that even the stronger LP relaxation has 
an integrality gap at least 2k — 1. Consider the instance on n = m = 2k — 1 items and constraints 



Note that this is optimal only asymptotically; in the case of k = 2, for instance, it is better to choose a ~ 2. 
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defined as follows. We view the indices [re] = {0, 1, • • • , re — 1} as integers modulo re. The weights 
wi = 1 for all i £ [re]. The sizes are: 




1 if i = j 

e if J £ {« + !,■■■ 



otherwise 



i + k — 1 (mod n)} 



Vi, j G [n]. 



where e > is arbitrarily small, in particular e <C i. 

Observe that setting Xj = 1 — fee for all i G [re] is a feasible fractional solution to the strengthened 
LP (0]|7]); each constraint has only one big item and so the new constraint © is satisfied. Thus the 
optimal LP value is at least (1 — ke) ■ re « re = 2k — 1. 

On the other hand, we claim that the optimal integral solution can only choose one item and hence 
has value 1. For the sake of contradiction, suppose that it chooses two items i, h € [re]. Then there is 
some constraint j (either j = i or j = h) that implies either x% + e • < 1 or Xh + e • Xi < 1; in either 
case constraint j would be violated. 

Thus the integrality gap of the LP we consider is at least 2k — 1, for every k > 1. 

Bad example for possible generalization. A natural extension of the &-CS-PIP result is to consider 
PIPs where the £i-norm of each column is upper-bounded by k (when capacities are all-ones). We 
observe that unlike /c-CS-PIP, the LP relaxation for this generalization has an Q(n) integrality gap. The 
example has rre = n; sizes sa = 1 for all i £ [n], and = - for all i / j; and all weights one. The 
£i-norm of each column is at most 2. Clearly, the optimal integral solution has value one. On the other 
hand, picking each column to the extent of 1/2 is a feasible LP solution of value re/2. 

This integrality gap is in sharp contrast to the results on discrepancy of sparse matrices, where 
the classic Beck-Fiala bound of 0(k) applies also to matrices with entries in [—1,1], just as well as 
{ — 1, 0, 1} entries; here k denotes an upper-bound on the £i-norm of the columns. 

3 Submodular Objective Functions 

We now consider the more general case when the objective we seek to maximize is an arbitrary monotone 
submodular function f : 2^ — > R + . The problem we consider is: 



As is standard when dealing with submodular functions, we only assume value-oracle access to the 
function: i.e. the algorithm can query any subset T C [re], and it obtains the function value f(T) in con- 
stant time. Again, we let k denote the column-sparseness of the underlying constraint matrix. Observe 
that this problem is a common generalization of maximizing submodular functions over: k partition 
matroids, and k knapsack constraints. In this section we obtain an 0(/c)-approximation algorithm for 
Problem ([8]). The algorithm is similar to that for A;-CS-PIP (where the objective was additive), and 
involves the following two steps. 

1. We first solve (approximately) a suitable continuous relaxation of ([8]). This step follows directly 
from the algorithm of Vondrak |[29l . 

2. Then, using the fractional solution, we perform the randomized rounding with alteration described 
in Section [2] Although the algorithm is the same as for additive functions, the analysis requires 
considerably more work. In the process, we also establish a new property of submodular functions 
that generalizes fractional subadditivity |[T5l . 




(8) 
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Solving the Continuous Relaxation. The extension-by-expectation (also called the multi-linear exten- 
sion) of a submodular function / is a continuous function F : [0, l] n — > R+ defined as follows: 

F(x) := n ieT Xi ■ U^ T (1 - Xj ) ■ f(T) 

TC[ n ] 

Note that F(x) = f(x) for x G {0, l} n and hence F is an extension of /. Even though F is a 
non-linear function, using the continuous greedy algorithm from Vondrak ll29l . we can obtain a (l — -)- 
approximation algorithm to the following fractional relaxation of ®. 

f n \ 

max < F{x) | ■ Xi < cj, Vj G [m]; < Xi < 1, Vi G [n] > (9) 

In order to apply the algorithm from 11291 , one needs to solve in polynomial time the problem of 
maximizing a linear objective over the constraints {X^ILi s ij ' x i < c j; Vj G [m]; < Xj < 1, Vi G 
[n]}. This is indeed possible since it is a linear program on n variables and m constraints. 

The Rounding Algorithm. The rounding algorithm is identical to that for /c-CS-PIP. Let x denote 
any feasible solution to Problem ®. We apply the rounding algorithm for the additive case (from the 
previous section), to first obtain a (possibly infeasible) solution S C [n] and then feasible integral 
solution S' C . In the rest of this section, we prove the performance guarantee of this algorithm. 

Fractional Subaddivity. The following is a useful lemma (see Feige 0JO) showing that submodular 
functions are also fractionally subadditive. 

Lemma 3.1 ( H15II ). Let hi be a set of elements and {At Q U\ be a collection of subsets with non-negative 
weights {Xt} such that Y2t\ieAt ^* — ^f or °U elements i Then, for any submodular fiinction f, we 
have f{U) < Z t Xtf(At). 

The above result can be used to show that (the infeasible solution) S has good profit in expectation. 

Lemma 3.2. For any x G [0, l] n and < p < 1, let set S be constructed by selecting each item i G [n] 
independently with probability p ■ X{. Then, E[f(S)] > pF(x). In particular, this implies that our 
rounding algorithm that forms set S by independently selecting each element i G [n] with probability 
x t /{ak) satisfies E[f(S)] > ^F(x). 

Proof. Consider the following equivalent procedure for constructing S: First, construct So by selecting 
each item i with probability x^. Then construct S by retaining each element in So independently with 
probability p. 

By definition E[f(So)] = F(x). For any fixed set T C [n], consider the outcomes for set S 
conditioned on So = T; the set S C Sq is a random subset such that Pv[i G S \ So = T] = p for all 
i G T. Thus by Lemma [37T1 we have E[f(S) | S = T] > p ■ f(T). Hence: 



E[f(S)}= £ Pr[S = T]-E[f(S)\S = T}> £ Pr[5 = T] -pf(T) = P E[f(S )\ = p-F(x). 

TC[n] TC[n] 

Thus we obtain the lemma. □ 

However, the analysis approach in Theorem 12.81 does not work. The problem is that even though 
S (which is chosen by random sampling) has good expected profit, i.e. E[f(S)] = ^l(^)F(x) (from 
Lemma [3721 above) . it may happen that the alteration step used to obtain S' from S may end up throwing 
away essentially all the profit. This was not an issue for linear objective functions since our alteration 
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procedure guarantees that Pv[i G S'\i G S] = $7(1) for each i G [n], and if / is linear, this implies 
E[f(S)] = S7(l) E[f(S')]. However, this property is not enough for general monotone submodular 
functions. Consider the following: 

Example: Let set S C [n] be drawn from the following distribution: 

• With probability l/2n, S = [n]. 

• For each i G [n], S = {i} with probability l/2n. 

• With probability 1/2 - l/2n, 5 = 0. 

Now define S' = S if S = [n], and S' = otherwise. Note that for each i G [n], we have 
Pr[i G S' | i G 5] = 1/2 = 0(1). However, consider the profit with respect to the "coverage" 
submodular function /, where f(T) = 1 if T ^ and is otherwise. We have E[f(S)] = 1/2 + l/2n, 
but £[/(£')] is only l/2n < £[/(£)]• 

Remark: Note that if 5' itself was chosen randomly from 5 such that Pr[z G S'|<S = T] = fi(l) 
for every T C [n] i 6 T, then we would be done by Lemma I3TT1 Unfortunately, this is too much 
to hope for. In our rounding procedure, for any particular choice of S, set S' is a fixed subset of S; and 
there could be (bad) sets S, where after the alteration step we end up with sets S' such that \S'\ <C \S\. 

However, it turns out that we can use the following two additional properties beyond just marginal 
probabilities to argue that S' has reasonable profit. First, the sets S constructed by our algorithm 
are drawn from a product distribution on the items; in contrast, the example above does not have 
this property. Second, our alteration procedure has the following 'monotonicity' property: Suppose 
T± C Ti C [n], and i G S' when S = T^. Then we are guaranteed that % G S' when S = T\. (That is, 
if S contains additional items, it is more likely that i will be discarded by some constraint it participates 
in.) The above example does not satisfy this property either. That these properties suffice is proved in 
Corollary 13.41 Roughly speaking, the intuition is that, since / is submodular, the marginal contribution 
of item i to 5 is largest when S is "small", and this is also the case when i is most likely to be retained 
for S'. That is, for every i G [n], both Pr[i G S' \ i G S\ and the marginal contribution of i to /(<S) 
are decreasing functions of S. To show Corollary 13.41 we need the following generalization of Feige's 
Subadditivity Lemma. 

Theorem 3.3. Let [n] denote a groundset, x G [0, l] n , and for each B C [n] define p(B) = Ui£B%i • 
rij^B(l — x j)- Associated with each B C [n], there is an arbitrary distribution over subsets of B, where 
each set A C B has probability qsi^A); so Xmcb Qb(A) = lfor all B C [n]. That is, we choose B 
from a product distribution, and then retain a subset A of B by applying a randomized alteration. 

Suppose that the system satisfies the following conditions. 

Marginal Property: 

Vi€[n], Qb(A) > P- W 

BC[n] AQB:i&A BC[n]:ieB 

Monotonicity: For any two subsets B C B' C [n] we have, 

Vi£B, Yl 1b{A) > Y 1B'{A') (11) 

ACB-.ieA A'CB':ieA' 
Then, for any monotone submodular function f, 

Y P(B) Y Qb{A) ■ f(A) > p ■ V{B) ■ f(B). (12) 

BC[n] ACB BC[n] 
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Proof. The proof is by induction on n, the size of the groundset. The base case of n = 1 is straightfor- 
ward. So suppose n > 2. For any subsets A C B C [n] such that n G v4, by submodularity we have that 
f(A) > f{B) - f(B \ {n}) + f(A \ {n}). Applying this, the left-hand-side of CEH> is: 



Y V{B) Y lB(A)f(A) + Y 9B(A)f(A) 

B<Z[n] \ACB:n£A AQB:n(A 



) 



> E^ B ) E <lB(A)-(f(B)-f(B\{n}) + f(A\{n})\+ Y ^(A)f(A) 

BC[n] \ACB:n£A ^ ' ACB:n£A J 

= E E *b{A) ■ f(A \ {n}) + Y P(B) Y Vb{A) ■ (f(B) f(B \ {n})) (13) 

BC[n] ACB BC[n] ACB:nEA ^ ' 

Next, we need the following inequality, 

Y E P(B) ■ Qb(A) ■ f(A \ {n}) > p Y p(B) ■ f(B \ {n}) (14) 

BC[n]ACB BC[n] 

This inequality actually follows by induction, by applying (fT2l) to suitably constructed distributions p 
and q on subsets of [n — 1]. We first complete the proof of the theorem using (fT4l . and prove (TBI ) later. 
We now claim that it suffices to show the following. 

Y P(B) Y 1b{A) ■ (f(B) - f(B \ {«})) > ■ Y P( B ) • (f( B ) ~ f( B \ W)) • (15) 

BC[n] ACB:neA BC[n] 

To see that this suffices, observe that upon adding (fT4l to (PT5T ) we obtain that the right hand side of (fT3b 
is at least the right hand side of dT2b . which will imply the result by (fT3l . We now focus on proving (fl3T ). 

Firstly, note that if x n = then (031 ) is trivially true. In the following we assume x n > 0. 
For any set [n — 1], define the following two functions: 

g(Y) := f(Y U {n}) - f(Y), and fc(y) := ]T q Y u{n}(A U {n}). 

ACY 

Clearly both g and /i are non-negative. Note that g is a decreasing function due to submodularity of /. 
Moreover function h is also decreasing: for any FCY'C [n — 1], 

h(X) = Y QYu{n}(Au{n}) > Y lY'u{n}(A' U {n}) = h(Y'), 

ACY A'CY' 

where the inequality is by the monotonicity condition with i = n, B = Y U {n} and B' = Y' U {n}. 

Consider the product probability space on 2^^ with marginal probabilities given by {a^}^ . For 
any Y C [n — 1], let p'(l^) = IljgyXj • il :? - 6 [ n _ 1 ]\y(l — Xj) denote its probability. Applying the FKG 
inequality [1] on the decreasing functions g and h, it follows that 

Y v'(Y).g{Y).h{Y) > ( Y P'(Y)9(Y)) ■ I E • (16) 

rc[n-i] \yc[n-i] / \yc[n-i] / 



Observe that 



Y P '(Y).h(Y)= Y P(YU x {n}) Y lYu {n} (AU{n}) = -j-Y P^ E ^(A), 

yC[n-l] YC[n-l] U ACY " BC[n] ACB:neA 
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which by the Marginal property with i = n is at least ■ x n = j3. Combining this with (fTBT ). 

J2 p'(Y)-g(Y)-h(Y) > (3 J2 P'( Y )-9(Y). (17) 



KC[n-ll yc[n-l] 



Using the definitions of g and h (and multiplying both sides by x n ), we obtain (|T51) . 

Proof of Inequality (fT4l . We show that this follows by applying Theorem 13.31 suitably on the ground- 
set [n — 1], with marginal probabilities {xi}^ . For each C C [n — 1] define p'(C) = Tli£c%i ■ 
n j€ [ n _ 1 ]\ c <(l — Xj). Associated with each C C [n — 1], let us define the distribution {g^(^4) | A C C} 
over subsets of C as follows: 

x n ' Qcu{n}(A U {n}) + (1 Xn) • qc(A), for all A C C C [n 1]. 
Note that ^M-yicccfn-l] 3c0^) = 1 f° r ever y C C [n — 1], since 

X! 1c( A )= x n- E 9CU{n}(^) + (1 - X n ) ■ E = X n + (1 - X n ) = 1, 

A:ACC A:ACCU{n} A:ACC 

using the fact that J2 A :Accu{n} Qcu{n}(A) = T,a-.acc 1c(A) = 1. 
To see that the Marginal property holds, for any i E [n — 1], we have: 

E ^ E flfcw 

CC[n-l] ACC:ieA 

= E P '( C ^ E ( ;r ™-fcu{n}(^)+2;n-gcu{n}(-4U{n}) + (l-X n )-g C (^)J 

CC[ti-l] ACC:i£A ^ ' 

= e ^'(^ E ^uw(^)+ E (i-^)p'(c) e 1C{A) 

CC[n-l] ACCU{n}:i£A CC[u-l] ACC:ieA 

= e p( Cu W) E fcu { «}(^)+ E ^ E ?<m 

CC[n-l] J 4CCU{n}:ieA CC[n-l] ACC:i€A 

= E E «w ^ p E = E JV)- 

BC[n] ACBiiEA BC[n]:ieB CC[n-l]:ieC 

Above, the inequality is by the Marginal property for the original instance on [n]. 
To show the Monotonicity property for any subsets CCC'C [n — 1], observe that: 

E q 'c( A ) = X n E Ucu{n}{A)+qcu{n}{AU{n})) + (1 - X n ) E <* C ( A> > 
ACC:i£A ACC:i£A ^ ' ACC:i&A 

= X n E 1CU{n}(A') + (1 - X n ) E ^c(^) 

A'CCll{n}:ieA' A<ZC:i&A 

— ^ n / j 9C"U{n} 

A'CC'U{n}:ieA' ACC':ieA 

= E ^(^) 

ACC':i£A 

Again, the inequality is by the monotonicity property on the original instance (on groundset [n]) for the 
pairs C C C and C U {n} CC'U {n}. 
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Finally, we can express the left-hand-side of inequality (fT4b as: 

p(B)J2lB(A)f(A\{n}) 



BC[n] ACB 

= ]T ( P (CU{n}) Y, qcu {n} (A)f(A\{n}) + p(C)Y<lc(A)f(A)) 

CC[n-l] ^ ACCU{n} ACC ' 

J2 Un ' P'(C) (lCU{n}(A') + q C U{n } (A' U {n})) f(A') + (1 - x n )p' (C) £ q C {A') f(A') 

CC[n-l] \ A'CC A'CC 

= E P'(°) E Qc(A')f(A') 

CC[n-l] A'CC 

> /? E = E (P(C)+P(CUW))/(C) = /? £ p(B)f(B\{n}), 

CC[n~l] CC[n-l] BC[n] 

which equals the right-hand-side of (TBI ). Above, the inequality is by the induction hypothesis on the 
instance on [n — 1]. This completes the proof of Inequality (fT4b . and Theorem 13 .3 1 □ 

Remark: It is easy to see that Theorem 13.31 generalizes Lemma |3~T1 Let Xi = 1 for each i £ [n]. 

The distribution |^4j, jj^xjj i s associated with B = [n]. For all other 5' C [n], its distribution has 

qB'(B') = 1. The monotonicity condition is trivially satisfied. By the assumption in Lemma |3~T1 the 
Marginal property holds with /3 = Thus Theorem 13.31 applies and yields the conclusion in 

Lemma [3~T1 

Corollary 3.4. Let S be a random set drawn from a product distribution on [n]. Let S' be another 
random set where for each choice of S, set S' is an arbitrary subset of S. Suppose that for each i € [n] 
the following hold. 

• Pr s [i G S' | i G S] > /3, and 

• For all T\ C Ti with T\ Bi,ifi£ S' when S = T2 then i G S' when S = T\. 
Then E[f(S')] > /3E[f(S)}. 

Proof. This is immediate from Theorem l3.3[ we simply associate the single set distribution (i.e. A = S') 
for each choice B of S. The two conditions stated above on the construction of S 1 imply the Marginal 
and Monotonicity properties respectively; and inequality (fT2l) translates to E[f(S')] > (3E[f(S)]. □ 

We are now ready to prove the performance guarantee of our algorithm. Observe that our rounding 
algorithm satisfies the hypothesis of Corollary 13.41 with /3 = e+ *^ , when parameter a = 1. Moreover, 
by LemmalU it follows that E[f(S)] > F{x)/(ak). Thus, 

Combined with the fact that x is an ^--approximate solution to the continuous relaxation ©, we have 
proved our main result. 

Theorem 3.5. There is a randomized algorithm for maximizing any monotone submodular function over 

2 

k-column sparse packing constraints achieving approximation ratio -^zj^ + o(k). 
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4 /c-CS-PIP Algorithm for general B 



In this section, we obtain substantially better approximation guarantees for /c-CS-PIP when the capaci- 
ties are large relative to the sizes. A useful parameter that measures this is the following (see eg. (271). 

Cj 

B := min — . 

ie[n],je[m] Sij 

We consider the /c-CS-PIP problem as a function of both k and B, and obtain an improved approxi- 
mation ratio of 0(k 1 ^ B ^); we also give a matching integrality gap (for every k and B > 1) for the 
natural LP relaxation. Previously, Pritchard |[25l studied /c-CS-PIP when B > k and obtained a ratio of 
(1 + k/B)/(l — k/B); in contrast, we obtain improved approximation ratios even when B = 2. 

Theorem 4.1. There is a ^4e • ((e + o(l)) [B\ /c) 1 ^' 8 ^ -approximation algorithm for /c-CS-PIP, and 

j^Tj- • ((e + o(l)) [B\ k) 1 -approximation algorithm for maximizing any monotone submodular 
function over k-column sparse packing constraints. 

It will be convenient to assume that the entries are scaled so that for every constraint j G [m], 
maxjgp^) = 1. So B = min j6 [ ro ] Cj > 1. 

Set a := 4e • (|_-BJ /c) 1 ^- 8 -!. The algorithm first solves the natural LP relaxation for /c-CS-PIP to 
obtain fractional solution x. Then it proceeds as follows. 

1. Sample each item i G [n] independently with probability xi/a. 
Let S denote the set of chosen items. 

2. Define new sizes as follows: for every item i and constraint j G N(i), round up to tij G {2~ a \ 
a G the next larger power of 2. 

3. For any item i and constraint j G N(i), let E{j denote the event that the items {i' G S | t^/j > 
Uj} have total i-size (in constraint j) exceeding one. Mark i for deletion if Eij occurs for any 

j G N(i). 

4. Return set S' C 5 consisting of all items i G 5 not marked for deletion. 

Note the differences from the algorithm in Section |2j the scaling factor for randomized rounding is 
smaller, and the alteration step is more intricate (it uses slightly modified sizes). It is clear that S' is a 
feasible solution with probability one, since the original s-sizes are at most the new t-sizes. 

The approximation guarantee is proved using the following theorem. 

/ . \ k 

Theorem 4.2. For each i G [n], probability Pr[z G S' | i G S] > I 1 



k[B\ 



Proof. Fix any i G [n] and j G N(i). Recall that Eij is the event that items {%' G S \ tyj > t^} have 
total t-size (in constraint j) greater than Cj. 

We first bound ~Px[Eij \ i G S]. Let = 2~ e , where £ G N. Observe that all the t-sizes that are at 
least 2~ l are actually integral multiples of 2~ e (since they are all powers of two). Let = {i' G [n] \ 
U'j > Uj} \ an d Yij := ^i'eXij ti'j " VeS where li'gs are indicator random variables. The previous 
observation implies that Yij is always an integral multiple of 2~ e . Note that 

Pr[£ 4j | i G S] = Pr >Cj-2~ l \ieS < Pr K tj > [ Cj \ - 2~ e \ i G S 

= PT[Yij>[cj\ \ieS], 
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where the last equality uses the fact that Yij is always a multiple of 2 . Since each item is included 
into S independently, we also have Pr[Yy > [cj\ \ i G S] = Pr[Yy > [cj\]. Now Yy is the sum of 
independent [0, 1] random variables with mean: 



x 2 2 
E[Yij] = ^2 k'j ■ Pr[i' G S] < ^2 U>j ■ < — Si>j ■ Xi> < —cj. 



i'=i 



t'=i 



Choose 5 such that (5 + 1) • -E^j] = [cjj, i.e. (using cj > 1), 



5 + 1 



E[Yij] 

Now using Chernoff Bound 11231 . we have: 



> 



Q L c jJ 
2- Cj 



a 

> -. 

- 4 



L«*J 



Pr[r«> N ] = Pr[y«>(i + i ). £| y y ]]<^j <(^j <(il) lSJ . 

The last inequality uses the fact that Cj > B. Finally, by the choice of a = 4e • ( |_-BJ /c) 1 /^ , 

Pr[£? y | i G 5] < Pr[y 4j > [^J] < 



(18) 



As in the proof of Theorem l2.8l for any fixed item i G [n], the conditional events {E^,- | i G 5} j6 jv(j) 
are positively correlated. Thus using (PT8T ) and the FKG inequality [H, 



Pr[i G 5' | i G 5] = Pr /\ -i^- | i G 5 
This completes the proof of the theorem. 



> II Pr W * e ^ f 1 " jnWV 



□ 



As a function of A;, we obtain that Pr[z G S' \ i G S) > (e + o(l)) 1/[BJ . Since Pr[t G S] = xt/a, 
we obtain the first part of Theorem 14. II 

This algorithm can also be used for maximizing monotone submodular functions over such packing 
constraints (parameterized by k and B). Again we would first (approximately) solve the continuous 
relaxation using ||29l , and perform the above randomized rounding and alteration. Corollary I3.4l can be 

used with Theorem l4.2l to obtain a ■ ((e + o(l)) [B\ k) 1 ^ B ^ -approximation algorithm. 
This completes the proof of Theorem 14. II 



4.1 Integrality Gap for General B 

We show that the natural LP relaxation for fc-CS-PIP has an VL{k l /^ B ^ ) integrality gap for every B > 1, 
matching the above approximation ratio up to constant factors. 

For any B > 1, let t := [B\ . We construct an instance of /c-CS-PIP with n columns and m = ( t 
constraints. For all i G [n], weight Wi = 1. 

For every (t + l)-subset C C [n], there is a constraint j(C) involving the variables in C: set 
s i,j(C) = 1 f° r ai l i £ C, and Sijrcf) = for f C. For each constraint j G [rrt], the capacity c, = -B. 
Note that the column sparsity k = ) — (we/t)*. 

Setting Xi = \ for all i G [n] is a feasible fractional solution. Indeed, each constraint is occupied to 
extent < < B (since 2? > 1). Thus the optimal LP value is at least §. 

On the other hand, the optimal integral solution has value at most t. Suppose for contradiction that 
the solution contains some t + 1 items, indexed by C C [n]. Then consider the constraint j(C), which 
is occupied to extent t + 1 = [2? J + 1 > 5, this contradicts the feasibility of the solution! Thus the 
integral optimum is t, and the integrality gap for this instance is at least ^ > k 1 ^ 3 ^ . 
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